Provenance in Databases: Why, How, and Where

نویسندگان

  • James Cheney
  • Laura Chiticariu
  • Wang Chiew Tan
چکیده

Different notions of provenance for database queries have been proposed and studied in the past few years. In this article, we detail three main notions of database provenance, some of their applications, and compare and contrast amongst them. Specifically, we review why, how, and where provenance, describe the relationships among these notions of provenance, and describe some of their applications in confidence computation, view maintenance and update, debugging, and annotation propagation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improv: Flexible Data Provenance for Relational Databases

Curated databases, which consist of data extracted from original sources, printed articles, and other databases, are a valuable source of data for scientists. However, as curated databases aggregate information from multiple sources, the origin of the data elements can be lost. Because of this, curated databases often provide support for data annotations, which are pieces of extra information a...

متن کامل

Why and Where: A Characterization of Data Provenance

With the proliferation of database views and curated databases, the issue of data provenance { where a piece of data came from and the process by which it arrived in the database { is becoming increasingly important, especially in scienti c databases where understanding provenance is crucial to the accuracy and currency of data. In this paper we describe an approach to computing provenance when...

متن کامل

Integrating Approximate Summarization with Provenance Capture

How to use provenance to explain why a query returns a result or why a result is missing has been studied extensively. Recently, we have demonstrated how to uniformly answer these types of provenance questions for first-order queries with negation and have presented an implementation of this approach in our PUG (Provenance Unification through Graphs) system. However, for realisticallysized data...

متن کامل

On Answering Why-Not Queries Against Scientific Workflow Provenance

Why-not queries help scientists understand why a given data item was not returned by the executions of a given work�ow. While answering such queries has been investigated for relational databases, there is only one proposal in this area for work�ow provenance, viz. the Why-Not algorithm. This algorithm makes the assumption that the modules implementing the steps of the work�ow preserve the attr...

متن کامل

GProM - A Swiss Army Knife for Your Provenance Needs

We present an overview of GProM, a generic provenance middleware for relational databases. The system supports diverse provenance and annotation management tasks through query instrumentation, i.e., compiling a declarative frontend language with provenance-specific features into the query language of a backend database system. In addition to introducing GProM, we also discuss research contribut...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Foundations and Trends in Databases

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2009